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I . PROBLEM  DISCUSSION 


Maintaining  the  quality  of  an  enclosed  environment  (e.g.,  aboard 
a submarine)  requires  monitoring  of  the  environment  by  means  of  sam- 
pling procedures.  One  general  type  of  sampling  procedure  is  "grab 
sampling",  in  which  atmospheric  samples  are  obtained,  essentially 
Instantaneously,  in  evacuated  flasks.  In  many  situations,  these  samples 
may  not  be  analyzed  for  days  or  months  (e.g.,  until  they  are  returned 
from  a submarine  to  a shore-based  laboratory).  Thus,  it  is  Important 
to  know  whether  such  a delay  affects  the  results  of  the  analysis,  or  in 
other  words,  whether  these  samples  have  a shelf  life. 

For  purposes  of  the  current  study,  four  particular  grab  sampling 
methods  were  of  interest.  These  methods  were  defined  by  differences 
in  preparation  and  filling  of  the  flasks.  One  question  of  major  Interest 
is  whether  any  of  these  methods  are  better  than  the  others.  Another  major 
question  is  whether  the  shelf  life  or  choice  of  sampling  method  is 
affected  by  the  concentration  of  contaminants  in  the  environment. 

In  order  to  obtain  information  on  which  to  base  an  answer  to  these 
and  other  questions,  a statistical  experiment  was  conducted.  This  experi- 
ment consisted  of  the  sampling  of  atmospheres  containing  eleven  selected 
contaminants.  The  experiment  was  designed  to  compare  the  four  grab  sam- 
pling methods  of  interest,  to  evaluate  the  shelf  life  of  the  grab  samples, 
to  measure  the  effects  of  contaminant  concentration,  and  to  Investigate 
the  sources  of  variability  in  the  overall  sampling  process. 

Eleven  contaminants,  listed  in  Table  1,  were  investigated  in  this 
study.  To  conserve  space  in  the  report,  they  will  for  the  most  part  be 
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#1 


Methane 


#2  Carbon  Monoxide 

#3  Dichlorodif luoromethane  (R-12) 

#4  1,  1,  2,  2-tetrafluoro-l,  2-dichloroethene  (R-114) 

#5  Vinylidene  Chloride 

#6  1,  1,  2-trichloro-l,  2,  2-trif luoroethane  (R-113) 

#7  Hexane 

#8  Methyl  Chloroform 

#9  Benzene 

#10  Trichloroethylene 

#11  Toluene 


Table  1:  Contaminants  Investigated  in  the  Experiment 
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referred  to  by  the  numbers  assigned  in  the  table.  For  purposes  of 

this  investigation,  each  contaminant  was  studied  at  a high  and  a low 

concentration  level.  The  specific  high  and  low  concentration  levels 

varied  with  contaminant.  In  general,  the  nominal  high  concentration 

« 

levels  were  in  the  range  60-110  ppm  and  the  nominal  low  concentration 
levels  were  in  the  range  5-12  ppm.  The  use  of  more  than  one  concentra- 
tion level  permits  the  estimation  of  any  differential  effects  which 
concentration  may  have  on  the  sampling  method  or  shelf  life. 

A number  of  sources  affect  the  results  obtained  from  a quantitative 
analysis  of  a particular  contaminant  in  a grab  sample.  For  purposes  of 
this  investigation,  the  following  contributing  sources  or  factors  have 
been  considered: 

(1)  Concentration  level 

(2)  Sampling  method 

(3)  Flask 

(4)  Measurement  (analysis) 

(5)  Time  to  analysis  (delay  time) 

In  addition,  the  interaction  effects  of  these  sources  were  also  investi- 
gated. 

Concentration  level,  which  is  self-explanatory,  is  in  itself  not  an 
important  experimental  factor,  since  high  concentrations  would,  of  course, 
be  expected  a priori  to  result  in  higher  observations  than  would  low 
concentrations.  However,  its  interactions  with  other  factors  are  impor- 
tant. For  example,  one  question  is  whether  one  sampling  method  provides 
the  best  results  for  high  levels  of  contaminant  concentrations  while 
another  method  provides  the  best  results  for  low  levels.  Another  question 
is  whether  different  concentration  levels  have  any  effect  on  shelf  life. 
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This  would  involve  interactive  effects  of  delay  time  with  concentration 
level. 

Four  sampling  methods  were  used  in  this  study.  These  methods  were 
defined  by  the  type  of  flask  preparation  (vacuum  only  or  vacuum-bake) 
and  the  procedure  for  filling  the  flask  (single  fill  or  purge  fill)  as 
follows: 

Sampling  Method  #1:  Vacuum-bake,  Single  fill 

Sampling  Method  #2:  Vacuum  only,  Single  fill 

Sampling  Method  #3:  Vacuum-bake,  Purge  fill 

Sampling  Method  // 4:  Vacuum  only,  Purge  fill 

In  addition  to  samples  obtained  by  these  methods,  in-situ  measurements 
were  also  obtained. 

Because  each  sample  was  obtained  in  a different  flask,  it  is  realistic 
to  assume  that  each  individual  flask  will  contribute  to  the  variation 
observed  in  the  sampling  results.  This  variation  may  result,  for  example, 
from  differential  effects  which  occur  in  preparing  or  filling  the  flask 
or  from  particular  characteristics  of  an  individual  flask.  This  variation 
may,  in  general,  be  regarded  as  experimental  error. 

Another  source  of  variation  that  enters  into  the  experimental  results 
is  measurement  error,  i.e.,  the  differences  in  repeated  analyses  of  the 
same  flask.  To  provide  an  estimate  of  this  variation,  duplicate  analyses 
(measurements)  were  made  on  the  contents  of  each  flask. 

To  study  any  possible  effects  that  delay  in  analysis  of  the  sample 
may  have,  it  was  decided  to  study  these  effects  over  the  period  from  the 
time  sampling  occurs  until  seventy  days  have  elapsed.  This  was  accomplished 
by  analyzing  samples  at  ten  day  intervals  during  this  period. 
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The  study  described  in  this  report  was  a joint  effort.  The  grab 
sampling  and  associated  chemical  analyses  were  conducted  by  the  Chemistry 
Division,  Naval  Research  Laboratory.  The  statistical  analysis  was  per- 
formed by  Desmatics,  Inc. 


II.  EXPERIMENTAL  DESIGN 


The  object  of  the  experiment  described  in  this  report  was  to  obtain, 
within  existing  operational  constraints,  the  most  information  on  the 
sources  of  variation  discussed  in  the  previous  section.  Any  experiment, 
when  set  up  and  analyzed,  makes  use  of  certain  assumptions.  A major 
assumption  is  the  underlying  model,  which  affects  not  only  specification 
of  the  data  collection  process,  but  also  the  final  statistical  analysis. 
The  following  sections  discuss  salient  details  of  the  experiment. 


A.  CONSTRAINTS  ON  THE  EXPERIMENT 

The  apparatus  used  for  obtaining  the  grab  samples  in  this  experiment 
was  a manifold.  Flasks  were  attached  to  ports  on  the  manifold,  which  was 
then  filled  with  a specified  concentration  level  of  the  eleven  contamin- 
ants. To  prevent  any  unanticipated  biases  in  an  experiment,  it  is  desirable 
to  completely  randomize  the  order  of  sampling  relative  to  all  of  the  factors 
considered.  However,  it  is  easily  seen  that  requiring  concentrations  to 
be  randomized  would  involve  an  extraordinary  amount  of  time  and  expense. 

It  is  more  reasonable  from  an  operational  standpoint  to  obtain  all  samples 
for  a given  concentration  before  samples  are  obtained  for  another  concen- 
tration. 

By  imposing  this  constraint  on  the  randomization  in  the  experiment, 
a split-plot  design  results.  This  design  owes  its  name  to  the  development 
of  statistical  experimentation  in  the  agricultural  fields  where  a single 
level  of  a treatment  (e.g.,  a given  type  of  fertilizer)  is  applied  to  a 
large  plot  of  ground,  while  all  levels  of  other  treatments  (e.g.,  types 


-6- 


of  seeds)  were  applied  at  random  within  that  plot. 

Because  of  the  constraints  on  the  randomization,  the  actual  effect 
of  concentration  is  not  estimated  very  accurately.  However,  as  mentioned 
previously,  it  is  not  the  singular  effects  of  this  factor  that  are  impor- 
tant, but  rather  the  interactive  effects  with  sampling  method  and  time 
delay. 

To  study  the  effects  of  time  delay  and  still  keep  the  number  of 
samples  within  reasonable  limits,  it  was  decided  to  analyze  each  flask 
at  intervals  over  a seventy  day  period,  rather  than  analyzing  each  flask 
at  only  one  time.  Thus,  the  results  obtained  for  any  time  are  not  indepen- 
dent of  the  results  at  another  time  because  they  are  both  based  on 
measurements  from  the  same  flask.  This  adds  another  randomization  restric- 
tion, repeated  measures,  on  the  experiment. 

B.  THE  UNDERLYING  STATISTICAL  MODEL 

The  usual  model  assumed  in  an  experimental  situation  is  one  in  which 
a given  observation  y is  a composite  of  a number  of  additive  effects. 

For  example,  in  the  current  situation,  the  usual  model  (taking  the 
constraints  on  randomization  into  account)  would  be: 

yijkmnp  “ M + ci  + pj(i)  + sk  + CSik  + PSjk(i)  + Fm(ijk) 

+ Vijta)  + TP  + "ip  + STkp  + CSTikp  + PTjP(i) 

+ PSTJkp(i)  + FTmp(ijk)  + Gnp(ijkm)  ^ 
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where  the  capital  letters  refer  to  the  sources  of  the  effects.  That  is, 

M denotes  an  overall  mean  effect 
C denotes  the  concentration  level  effect 
P denotes  the  plot  effect 
S denotes  the  sampling  method  effect 
F denotes  the  flask  effect 
A denotes  the  analysis  effect 
T denotes  the  time  effect 
£ denotes  a random  error  effect. 

The  two-  and  three-letter  combinations  in  the  model  indicate  the  inter- 
active effects  between  the  various  factors,  while  the  subscripts  denote 
the  specific  level  of  any  factor  or  factor  combination.  A subscript 
expression  containing  parentheses  indicates  that  the  particular  factor 
levels  outside  the  parentheses  are  nested  in  a hierarchical  relationship 
within  the  factor  levels  inside  the  parentheses.  For  example,  PT^^ 
indicates  the  nesting  of  the  jth  plot  and  pth  time  within  the  ith  concen- 
tration level. 

An  implicit  assumption  underlying  the  statistical  analysis  based  on 
this  model  is  that  the  random  error  term  e has  the  same  variance  for  every 
observation.  In  general,  this  assumption  is  not  critical  unless  there  is 
extreme  inequality  of  variance.  Unfortunately,  this  point  might  be 
approached  in  the  current  situation,  since  variation  may  be  expected  to  be  on 
a proportional,  rather  than  an  absolute,  basis.  That  is,  variation  would 
be  expected  to  be  roughly  proportional  to  the  true  contaminant  concentration 
level.  Therefore,  observations  for  low  concentration  levels  would  be  expected 
to  have  much  less  variability  in  an  absolute  sense  than  observations  for  high 
concentrations,  and  this,  in  fact,  is  what  the  data  shows. 

To  prevent  this  extreme  heterogeneity  of  variance  from  adversely 
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affecting  the  results  of  any  resulting  statistical  analysis,  there  are 
two  alternatives.  The  first  is  to  analyze  the  results  for  high  and  low 
concentrations  separately.  This  approach,  however,  precludes  obtaining 
information  on  the  interactive  effects  of  concentration  levels  with  other 
factors.  The  second  alternative  is  to  use  an  appropriate  transformation 
to  stabilize  the  error  variance.  If,  instead  of  analyzing  the  observations 
yijkmnp  directly,  t'le  transformed  logarithmic  observations 

Wi j kmnp  ln  yi j kmnp 

are  analyzed,  the  resulting  error  variance  structure  will  be  relatively 
stable  so  that  the  overall  experiment  involving  both  concentration  levels 
may  be  considered  in  a single  analysis.  It  should  be  noted  than  an 
additive  model  in  the  transformed  observations  (the  w's)  implies  a multi- 
plicative model  in  the  y's. 

In  general,  the  existing  error  structure  of  the  problem  indicates 
that  analyzing  the  transformed  observations  should  provide  the  optimal 
amount  of  information.  However,  because  there  may  be  interest  in  the 
results  of  an  analysis  conducted  on  the  untransformed  observations,  per- 
tinent information  regarding  this  analysis  is  also  included. 


C.  CONDUCTING  THE  EXPERIMENT 

As  mentioned  previously,  the  constraints  imposed  on  the  experiment 
made  a split-plot  arrangement  involving  repeated  measures  and  nesting 
the  most  appropriate  design.  Although  this  design  is  not  one  found  in 
a textbook,  it  may  be  obtained  by  imbedding  repeated  measures  with  nesting 
into  a split-plot  design.  To  provide  reasonable  degrees  of  freedom  for 
testing  for  statistically  significant  sources  of  variation,  three  replicates 
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of  this  design  were  used. 


To  insure  an  analysis  free  of  unintentional  bias,  the  actual  grab 
sampling  and  subsequent  analysis  were  randomized  within  the  existing 
constraints.  Specifically,  the  position  of  flasks  (corresponding  to 
the  given  sampling  types)  was  randomized  on  the  manifold  and  the  order 
of  analysis  at  each  time  was  randomized.  This  was  accomplished  by  the 
experimenter  following  a set  of  randomization  plans  which  were  obtained 
from  a random  number  table. 


III.  STATISTICAL  ANALYSIS 


A summary  of  the  data  collected  In  the  experiment  is  given  in 
Table  2 through  Table  12.  The  statistical  analysis  of  this  data  consists 
of  a number  of  steps,  which  are  described  in  the  following  sections. 


A.  CONSIDERATION  OF  SUSPICIOUS  DATA 

As  in  most  experimental  studies  of  this  magnitude,  there  was  suspi- 
cious data  in  this  experiment.  This  suspiciousness  existed  because  this 
particular  data  appeared  extremely  unrepresentative  of  other  data  taken 
under  similar  conditions.  Actually,  there  were  three  sets  of  suspicious 
data.  An  assignable  cause  for  the  first  set  was  easily  found  to  be  a 
slippage  of  the  decimal  point  in  the  data  cards  supplied  for  statistical 
analysis.  Thus,  this  set  of  suspicious  data,  which  was  erroneous,  was 
easily  corrected.  Only  this  corrected  data  is  included  in  the  tables. 
Because  no  specific  assignable  causes  were  found  for  the  other  two  sets 
of  suspicious  data,  the  tables  contain  the  data  as  recorded. 

A situation  more  complex  than  slippage  of  the  decimal  point  existed 
in  the  second  set  of  data,  which  corresponded  to  low  concentration  levels. 
The  suspiciousness  of  this  data  was  traced  to  the  quantitative  analyses 
of  one  flask  which  yielded  extremely  low  observations  for  most  contami- 
nants and,  in  fact,  readings  of  zero  for  contaminant  // 8 (methyl  chloroform). 
The  second  flask,  however,  did  not  confirm  these  extreme  observations. 
Because  of  this,  an  abnormally  large  amount  of  variability  exists  in  the 
results.  This  can  be  seen,  for  example,  by  examining  the  low  concentra- 
tion measurements  in  plot  #1  of  Table  9. 


PLOT  1 PLOT  2 PLOT  3 


0- 

in  in 

ro  av 

r~  o' 

co  cn 

P*  O 

cn  a 

r-  rn 

p-  r- 

CD  O' 

CN  av 

a CD 

in  ao 

cd  r~ 

cn  m 

in  cn 

• • 

• • 

• • 

0 • 

• • 

• 1 

0 0 

0 0 

O O' 

O'  CD 

O CD 

r-  CO 

O'  ao 

O CD 

CM  00 

O X 

r- 

U3 

r* 

p- 

r- 

p~ 

p' 

o m 

00 

*-  p*. 

V—  T~ 

in  m 

1C  o 

^ T- 

X 

CO 

o>  a 

f"  SO 

oo  o 

lO  O' 

CM  vO 

M3  O 

O'  M3 

a co 

• • 

• • 

• • 

• 9 

• • 

0 • 

0 0 

0 0 

£ 

O O' 

<p  00 

oo  O' 

O CD 

••  CO 

O CP 

r*  OD 

© CO 

4J 

r- 

VO 

lO 

r- 

r- 

O' 

P- 

r~ 

i 0) 

X 

00 

c 

st  r- 

v-  VC 

vP  r- 

oo  p* 

\C  r- 

o «- 

0-  X 

m o 

00 

a o 

ao  oo 

o ao 

m x 

9 00 

in  rr> 

CN  T- 

CM 

• • 

• • 

• * 

• • 

• • 

• 0 

0 0 

9 9 

f-H 

o cp 

x ao 

o ao 

O oo 

o ao 

r-  X 

co  eo 

O X 

i 

VO 

r- 

r- 

p> 

p- 

v-< 

(y 

9 

C/5 

3 

o 

r~  cm 

O 00 

ro 

r-  r- 

00  CM 

r-  CD 

CP  ' c 

® m 

X CN 

<N  X 

O O' 

O O' 

cm  r~ 

ro  r- 

oo  m 

C0  CM 

V-C 

i—4 

• • 

• « 

• • 

• 0 

0 t 

0 0 

0 • 

0 0 

s 

O X 

O 03 

x ao 

O 00 

CP  00 

o ao 

o ao 

o oo 

r- 

vP 

M3 

p* 

r- 

vj 

X T- 

C'  P' 

CN 

CN 

o m 

VO  ^ 

O OV 

CN  X 

c 

CTJ 

o CD 

p-  p- 

r1  X 

ro  cd 

it 

o cp 

00  00 

O X 

• • 

0 • 

• • 

• • 

• • 

0 0 

0 0 

0 0 

cm  ac 

cn  ao 

CM  00 

cm  ao 

f—  00 

CN  CD 

o oo 

CD 

0) 

p- 

p- 

r~ 

r~ 

r- 

p* 

p* 

n- 

*-  a 

0D  5* 

r~  O' 

ro  r- 

X)  1/5 

9 00 

X VC 

o <p 

(Xm 

O 

CD  VO 

ao  r* 

CN  vP 

in  in 

oo  r- 

X ^ 

x in 

•C 

m 

• • 

• • 

9 9 

• • 

• • 

0 0 

0 0 

0 0 

»-• 

4-» 

CM  UL> 

*-  ao 

cn  ao 

cm  ao 

r- 

0-  X 

O O' 

X 

0) 

0) 

X 

r~ 

p- 

r- 

r~ 

r~- 

r* 

P- 

P- 

P> 

00 

c 

CM  CM 

x ao 

cd  ac 

oo  ao 

O'  r- 

CN  CN 

r-  m 

x X 

•H 

no  in 

<N  X 

m in 

io  io 

in  •- 

cc 

cn  rn 

it  rn 

x: 

X 

CM 

• • 

• • 

9 9 

• t 

0 0 

9 9 

0 0 

• 0 

s* 

CM  O' 

CM  CO 

CM  O' 

cn  ac 

r-  O' 

r-  00 

© CP 

O X 

32 

3 

r~ 

1" 

p- 

p- 

r~ 

r> 

C/5 

P 

o 

no  r- 

X ro 

r-  in 

CN  O 

vn  ct 

:t  in 

CM  CM 

m x 

VM 

VC  go 

ro  x 

ro  r- 

p'  O' 

it  o 

in  r- 

X 

x X 

• • 

• • 

• 0 

# • 

• • 

0 • 

0 0 

0 0 

e 

»-  00 

r-  00 

*-  x 

cm  ac 

<T 

CN  CC 

O X 

O X 

P- 

r- 

p* 

r~ 

r* 

Pu 

C 

r-  f- 

a CM 

it  VO 

cn  cr 

r'  vn 

X rn 

v£  IO 

•H 

O X 

ro  *— 

X CD 

O lO 

O CD 

it  r- 

x 0- 

x 

<t 

« • 

• • 

• 0 

0 • 

9 9 

0 0 

0 0 

0 0 

C fl 

i n eo 

a 

CD 

cm  an 

cm  on 

r-  O' 

CN  CT 

0-  X 

r~ 

r 

n- 

r- 

p 

0) 

e 

o oc 

r-  r- 

o o 

m r* 

ro  o 

a-  cn 

cr  x 

x in 

d) 

p 

c-  \o 

O'  \C 

X 0“ 

vD  GO 

O O' 

CD 

m x 

m it 

x: 

• • 

• • 

• • 

0 • 

• t 

0 • 

0 0 

• 0 

co 

ro  ac 

o ac 

c ys 

cn  ao 

CN  00 

t-  o 

ro  X 

CM  O' 

<u 

V 

X 

p* 

r- 

r- 

r- 

r- 

r: 

<y 

00 

c 

ro  o 

CN  ^ 

o 

CN  St 

vn  ro 

cm  in 

m cn 

m ® 

co 

ctf 

it  X 

in  ® 

as 

o in 

r-  r- 

a a 

O P~ 

m x 

p 

CM 

• • 

• • 

• • 

0 0 

0 0 

0 • 

« 0 

0 • 

0i 

cx. 

ro  ao 

r-  ao 

ao 

cn  00 

CN  X 

r-  X 

rn  X 

CN  X 

> 

§ 

r- 

r- 

r* 

r- 

r* 

r- 

CO 

•-  a 

ro  in 

cn  ao 

O v£ 

r-  ro 

o m 

r-  x 

X it 

CN 

r~  at 

cp  vc 

r-  r- 

ao  r- 

O CD 

o o 

ID  O 

in  m. 

V 

9 9 

• • 

• • 

• • 

9 9 

9 0 

0 • 

0 0 

ro  x 

o ac 

cn  ao 

CN  CC 

1-  CD 

o x 

CN  O' 

r*  X 

p 

c* 

C' 

p- 

p- 

p~ 

r- 

nj 

H 

o 

o 

o 

o 

o 

o 

o 

0) 

o 

*—4 

CM 

ro 

't 

r* 

0 

>> 

>s 

>s 

>% 

>> 

H 

<2 

Q 

eo 

a 

Q 

eo 

a 

2 

2 

<cj 

a 

2 


Concentrations  of  Contaminant  //I  (Methane) 


13 


Table  3:  Average  Measurements  (in  ppm)  for  High  (Upper  Figure)  and  Low  (Lower  Figure) 

Concentrations  of  Contaminant  #2  (Carbon  Monoxide) 
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Concentrations  of  Contaminant  113  (R-12) 
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Concentrations  of  Contaminant  #4  (R-114) 
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Concentrations  of  Contaminant  #5  (Vinylidene  Chloride) 


Sampling  Method  I Sampling  Method  I Sampling  Method 


o © 

v©  O' 

O'  r~ 

«“  O 

Ml  CD 

o> 

rn  v0 

O'  l» 

Ml  1/1 

O 9 

Ml  O' 

*"  cr 

10  O' 

Ml  CD 

CO 

in  r- 

• • 

• 9 

• • 

• • 

9 9 

9 • 

• 9 

• 9 

M!  CD 

* VO 

10  Ml 

r-  m 

a m 

V0  Ml 

P'  Ml 

in  vo 

CD 

OD 

CO 

CD 

ao 

CO 

co 

CO 

ro  r* 

O'  r- 

^ O' 

O © 

10  O' 

fN  * 

ro  v0 

St 

®v  -3- 

o'  r- 

^ no 

V0  co 

<7v  <7v 

f*>  co 

CD  r» 

• 9 

• • 

• • 

• • 

• 9 

• • 

• • 

• • 

in  cd 

m io 

fO  in 

<0  10 

Ml  Ml 

Ml  Ml 

v0  Ml 

Ml  v0 

CD 

CD 

ao 

co 

CO 

00 

CD 

ao 

m o> 

cr  m 

0 O' 

10  co 

fN  VO 

od  ao 

ro  © 

V0  (" 

o a 

o o 

cd  r- 

ro  o 

CD  O' 

v0  CO 

«-  O' 

CO  O' 

• • 

• • 

• 9 

• • 

• 9 

• • 

• • 

9 9 

in  go 

vo  r~ 

Ml  Ml 

10  10 

*t  in 

Ml  Ml 

CD  Ml 

in  mi 

CD 

0D 

CO 

00 

CO 

co 

CD 

00 

ro  O' 

no  cd 

*-  CD 

10  Ml 

O f» 

fN  O 

r»  co 

O'  o 

ID 

cd  m 

CM  00 

OC  o 

*“  O' 

10  O' 

co  cd 

ac 

• • 

• • 

• • 

• • 

• • 

• • 

9 • 

• • 

m cd 

Ml  VO 

a Ml 

Ml  v0 

^ in 

in  in 

SO  Ml 

m v0 

OD 

00 

00 

co 

00 

CD 

CD 

co 

fN  1" 

ao  r- 

» CD 

**  VO 

r>  mi 

O O' 

st  O' 

cc  cn 

a ro 

r-  co 

P~  v£. 

r-  o 

O'  O' 

ro  Ml 

«-  0D 

vo  co 

• • 

• • 

• 9 

• • 

9 • 

• • 

• • 

• § 

IT  00 

a m 

« 0D 

Ml  00 

Ml  Ml 

NX)  O' 

m in 

O Ml 

00 

CD 

00 

OD 

co 

ao 

00 

OC 

so  m 

ro  O 

O SO 

r-  vo 

fN  r- 

O in 

CN  CN 

a ro 

r-  ro 

(N  CD 

10  3 

od  r* 

ro  00 

f-  SO 

r-  O' 

O CD 

• 9 

• 9 

• • 

• • 

• • 

• • 

• • 

• • 

in  cd 

a Ml 

^ 00 

St  M> 

vc  m 

VC  vo 

mi  in 

vo  m 

CD 

ao 

OD 

00 

CD 

oc 

0D 

co 

r-  in 

O'  r- 

r-  ro 

co  a 

(N  fN 

in  v0 

fN  ro 

v0 

St  00 

in  ro 

O CD 

in  co 

o in 

10  O' 

CM  CD 

• 9 

9 • 

• • 

• • 

• • 

• • 

• 1 

9 • 

in  cd 

cr  in 

^ CD 

in  in 

in  in 

io  r- 

a mi 

in  in. 

00 

00 

0D 

0D 

OC 

CD 

ao 

0D 

in  oo 

00  VO 

O O' 

ro  <r> 

f-  O' 

cm 

04  st 

© V- 

r-  in 

Ml  r* 

CD  St 

sr  oo 

od  r- 

m so 

<N  0D 

gd  or 

in  an 

o in 

St  00 

in  in 

m mi 

vc  r- 

in  m 

in  in 

cr 

0D 

OD 

CD 

or 

00 

or 

ar 

cm  m 

Ml  V0 

co  st 

O'  ro 

r*  ^ 

so  cr 

v0  O' 

© m 

V0  a 

rsi  m 

cm  ao 

0D  00 

vo  no 

o r- 

r-  so 

so  o 

9 9 

t • 

• • 

• 9 

• • 

• • 

• • 

• • 

r-  ' c 

if  ao 

m in 

ct  in 

in  r- 

a in 

in  M> 

in  vo 

00 

ao 

ao 

CD 

CD 

OD 

CC 

00 

m © 

o m 

oo 

in  co 

9T~  O' 

sc  ro 

sO  <N 

0s 

sO  fN 

r-  m 

00  vO 

m oo 

id  a 

O'  O' 

V0  -3 

^ r* 

• 9 

9 • 

9 9 

9 • 

• • 

9 9 

• • 

• • 

so  so 

* 0D 

st  in 

in  if 

^ r- 

ro  m 

Ml  CD 

Mi  r- 

00 

ao 

ao 

oc 

0D 

ar 

CD 

0D 

in  0s 

O cn 

m ro 

^ ON 

ro  r- 

vo  in 

h O' 

in  o 

m (N 

o m 

cd  r- 

r-~  r- 

>0  Ml 

r-  r- 

(N  CC 

O'  O' 

• • 

• • 

• 9 

9 9 

• • 

• • 

• • 

• 9 

sO  VO 

m oc 

3»  Ml 

in  in 

^ IT 

in  cc 

m in 

CC 

ao 

0D 

0D 

0D 

0D 

CD 

0D 

CD  <N 

so  o 

fN  ^ 

o :» 

Ml  O' 

St  CO 

*-  VO 

CD  fN 

St 

r*  4 

04  00 

10  CD 

m vo 

3 r~ 

O CM 

r-  O' 

r-  id 

co  ao 

in  in 

Ml  Ml 

m od 

in 

in  r- 

VO  IT 

CD 

0D 

00 

00 

co 

ac 

ao 

a 

o 

o 

o 

o 

o 

o 

o 

o 

f4 

fN 

ro 

m 

vO 

>* 

* 

>v 

5N 

►. 

<TS 

$ 

<U 

fl) 

« 

<TJ 

<0 

a 

a 

Q 

O 

a 

o 

a 

o 

Concentrations  of  Contaminant  //6  (R-113) 
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Concentrations  of  Contaminant  itl  (Hexane) 
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Concentrations  of  Contaminant  //8  (Methyl  Chloroform) 
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Concentrations  of  Contaminant  #9  (Benzene) 
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Concentrations  of  Contaminant  #10  (Trichloroethylene) 


Sampling  Method  I Sampling  Method  I Sampling  Method 
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Concentrations  of  Contaminant  #11  (Toluene) 


Two  different  assumptions  may  be  made  to  explain  this  large  vari- 
ability. The  first  is  that  the  data  in  question  indicates  experimental 
variability,  and  as  such,  should  be  included  in  the  statistical  analysis. 

The  second  assumption  is  that  something  went  wrong  in  the  experiment , with 
the  observed  data  reflecting  the  spurious  effect  of  whatever  did  go  wrong. 
Under  this  assumption,  the  flask  generating  the  suspicious  data  should  be 
dropped  from  consideration  in  the  statistical  analysis. 

Based  on  the  fact  that  these  low  readings  from  one  particular  flask 
immediately  caused  doubt  as  to  their  accuracy,  the  subsequent  analysis 
was  based  on  the  second  assumption.  Unfortunately,  this  affects  the 
balance  of  the  experiment  so  that  in  order  to  obtain  an  uncontaminated 
statistical  analysis  (i.e.,  one  in  which  there  is  no  confounding  of  effects), 
a number  of  observations  must  be  discarded. 

In  the  third  suspicious  set  of  data,  all  observations  of  carbon 
monoxide  for  sampling  method  #1  (vacuum-bake,  single  fill)  in  plot  03 
were  substantially  higher  than  observations  for  any  of  the  other  three 
sampling  methods.  This  suspicious  data  was  confined  to  this  contaminant 
only.  However,  these  high  observations  were  repeated  for  each  of  two 
flasks  which  had  been  prepared  and  analyzed  independently.  Furthermore, 
investigation  uncovered  no  anomaly  in  the  experimental  procedure  which 
would  explain  these  suspicious  readings. 

Thus,  it  was  decided  to  analyze  the  data  as  recorded,  realizing  that 
the  suspicious  observations  would  contribute  substantially  to  experimental 
variaton  (specifically  to  the  interaction  of  plot  with  sampling  method). 

For  comparison,  a similar  analysis  was  performed  using  only  the  data  of 
plots  #1  and  02.  Although  there  were  numerical  differences  in  the  results, 
the  conclusions  from  each  analysis  were  essentially  the  same. 
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B.  ANALYSIS  OF  VARIANCE 


Analysis  of  variance  breaks  out  the  total  variation  in  the  experi- 
ment and  ascribes  it  to  each  of  the  components  in  the  additive  model 
given  by  (1),  or  by  its  multiplicative  counterpart,  depending  upon  the 
one  adopted.  As  previously  noted,  each  concentration  (high  and  low) 
was  analyzed  separately  under  the  additive  model  because  of  the  large 
heterogeneity  of  variance.  For  the  multiplicative  model  (i.e.,  using 
logarithms  of  the  observations),  both  concentrations  were  analyzed 
together. 

Tables  13,  14,  and  15  summarize  the  analyses.  It  will  be  noted  that 
no  estimates  of  flask  affects  are  given  for  the  two  analyses  (Tables  14 
and  15)  which  include  low  concentrations.  This  is  the  result  of  discard- 
ing the  second  set  of  suspicious  data  and  additional  observations  to 
prevent  confounded  estimates.  Of  course,  Table  15  is  of  major  interest. 

As  can  be  seen  from  Table  15,  there  is  little  statistical  evi- 
dence that  any  of  ti.e  sampling  methods  give  different  results  for  any  of 
the  contaminants  investigated.  However,  for  most  of  the  contaminants 
there  is  a significant  time  effect,  which  warrants  further  analysis. 

The  time  effect  may  be  examined  in  a regression  framework.  That  is, 
the  observed  concentration  is  assumed  to  vary  about  a mean  value  which  is 
a function  of  time,  defined  by  a point  on  a regression  (time  trend)  line. 

In  the  current  experimental  situation,  it  is  realistic  to  assume  that  any 
time  effect  should  be  reflected  in  a decay  process,  hopefully  the  same 
for  high  and  lew.  In  other  words,  it  is  assumed  that  observed  concentration 
is  proportional  to  eat  , where  t is  the  number  of  days  elapsed  from  sampling 
to  analysis  and  a is  an  unknown  decay  parameter.  Because  of  this,  the  use 
of  the  transformed  observations  (i.e.,  logarithms)  is  indicated,  since  this 
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Table  13  (continued):  Mean  Squares  from  Analysis  of  Variance 
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Table  14:  Mean  Squares  from  Analysis  of  Variance 
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Table  14  (continued):  Mean  Squares  from  Analysis  of  Variance 


Contaminant 


* Denotes  mean  square  significant  at  0.05  level 
**  Denotes  mean  square  significant  at  0.01  level 
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permits  a direct  test  of  whether  the  time  trend  is  the  same  for  both 
high  and  low  concentrations  of  a given  contaminant  and,  if  so,  whether 
an  exponential  decay  assumption  is  reasonable. 

From  the  analysis  of  variance  for  the  transformed  data  given  in 
Table  15,  it  can  be  concluded  that  there  is  no  interaction  between  con- 
centration and  time  (the  CT  term).  From  this  it  follows  that  any  time 
trend  which  exists  is  the  same  for  both  high  and  low  concentrations.  With 
this  conclusion,  the  next  step  is  to  investigate  whether  or  not  the  time 
trend  is  of  the  form  eat  . This  is  equivalent  to  examining  the  transformed 
logarithmic  observations  for  a linear  time  trend. 

Because  of  the  equal  spacing  on  the  time  scale,  the  experimental 
variation  which  resulted  from  time  differences  may  be  easily  decomposed 
into  that  accounted  for  by  a linear  trend  and  that  accounted  for  by  higher- 
order  trends.  This  was  done  with  the  result  that,  for  all  contaminants 
for  which  there  was  a statistically  significant  time  trend,  the  linear 
trend  was  always  statistically  significant  but  the  higher-order  trend  was 
not.  This  means  that  a simple  exponential  time  trend  is  a reasonable 
assumption. 

Figure  2 exhibits  the  estimated  parameter  a for  the  nine  contaminants 
for  which  a statistically  significant  time  trend  was  found.  (There  was 
no  statistically  significant  time  effect  for  contaminant  #1  or  #6.)  The 
figure  also  indicates  the  predicted  change  in  concentration  over  a 70-day 
period. 

It  is  interesting  to  note  that  although  contaminants  #3  (R-12)  and 
#4  (R-114)  have  statistically  significant  time  trends,  the  estimated  value 
of  a is  positive,  indicating  an  exponential  increase.  Both  of  these  con- 
taminants are  refrigerants,  as  is  contaminant  #6  for  which  no  significant 
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Contaminant 

Estimated 
Value  of  a 

Predicted  Change 
after  70  Days 

#1 

— . 

#2 

-.001614 

-10.7% 

#3 

+.000404 

+ 2.9% 

#4 

+.000896 

+ 6.5% 

#5 

-.000425 

- 2.9% 

#6 

— 

— 

#7 

-.000424 

- 2.9% 

#8 

-.000546 

- 3.7% 

#9 

-.000682 

- 4.7% 

#10 

-.000697 

- 4.8% 

#11 

-.000825 

- 5.6% 

Figure  2:  Value  of  the  Estimated  Parameter 

a in  Time  Trend  eat 
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time  trend  was  found.  Based  on  these  results,  it  appears  that  the 
refrigerants  react  differently  from  the  other  contaminants  over  time. 

Figures  3 through  11  present  graphs  of  the  nine  significant  time 
trends  as  well  as  the  observed  values  at  each  time.  The  observed  values 
here  are  averages  (on  a logarithmic  scale)  over  all  samples  and  plots 
for  each  particular  time.  Also  included  are  the  upper  and  lower  95% 
confidence  bounds  on  the  estimated  time  trend. 

Because  the  value  of  a is  relatively  small,  eat  may  be  fairly  well- 
approximated  by  the  linear  equations  1 + at  . Therefore,  the  time  trend 
would  also  appear  approximately  linear  in  the  original  (untransformed) 
observations.  However,  in  order  to  take  the  initial  concentration  into 
account,  there  would  be  a different  slope  corresponding  to  each  concentra- 
tion level.  This  is  avoided  by  using  the  exponential.  Furthermore,  when 
both  concentration  levels  may  be  analyzed  together,  there  are  twice  as 
many  effective  observations  on  which  the  resulting  estimates  are  based. 

In  general,  this  provides  more  accurate  estimates. 

As  can  be  seen  from  Figures  3 through  11,  the  experimental  determina- 
tion of  the  time  trend  is  reasonably  accurate,  as  the  relatively  narrow 
confidence  bounds  attest.  Thus,  the  estimated  time  trend  may  be  used  to 
adjust  the  concentration  provided  by  an  analysis  which  is  carried  out  some 
period  of  time  after  actual  sampling. 

Suppose  that  a sample  is  analyzed  at  time  t*  (in  days).  Let  y(t*) 
denote  the  resulting  observation  and  let  w(t*)  * In  y(t*).  If,  for  the 
particular  contaminant  under  study,  the  estimated  value  of  the  parameter  a 

A 

is  a,  the  following  relationship  exists  between  w(t*)  and  w(0),  the 
(transformed)  estimated  observation  at  time  t = 0,  i.e.,  the  time  of 
sampling: 
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Time  Trend  for  Contaminant  //10 


E[w(0)]  - E[w(t*)  - at*]  . 

Thus,  as  intuition  indicates,  an  observation  at  time  t*  would  be  adjusted 
by  extrapolating  backward  along  the  regression  line  to  time  t * 0,  and 
then  untransforming  the  "logged"  observations.  That  is,  the  adjusted 
value  would  be  given  by 

y(0)  = exp [ In  y(t*)  - at*]  . 

For  example,  suppose  an  analysis  done  on  contaminant  //ll  (toluene) 
seventy  days  after  the  sample  was  taken  showed  a concentration  of  84.9  ppm. 
The  best  estimate  of  the  actual  concentration  of  toluene  when  the  sample 
was  taken  would  be: 

y (0)  - exp [ In  (84.9)  - (-.000825)  (70)] 

= exp[4. 44147  + .05775] 

* exp[4. 49922] 

- 90.0  ppm  . 

Although  this  value  (90.0  ppm)  provides  the  best  estimate  of  the 
actual  concentration  at  time  t = 0,  it  must  be  realized  that  there  is  a 
large  degree  of  variability  in  this  estimate.  This  variability  results 
not  from  the  variability  in  the  regression  line,  which  is  minimal,  but 
rather  from  the  inherent  variability  in  a single  observation.  This  can 

A 

be  seen  from  the  approximate  952  confidence  interval  for  y(0). 

A 

It  can  be  verified  that  the  variance  of  w(0)  is  given  by 

V[w(0) ] - V[w(t*)  - at*] 

“ V[w(t*)]  + (t*)  V(a)  - 2 cov[at*,w(t*) ] 

- o2[l  + (1  + ( ( t *—  35) 2 + ( t *) 2 ] /4200) /48 ] . 
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From  the  experiment  an  estimate  of  the  value  of  0 may  be  obtained 

2 

from  the  plot  x time  (PT)  mean  square.  For  toluene,  the  estimate  of  a 
is  .0022,  as  can  be  seen  from  Table  15.  Since  this  estimate  has  28  degrees 
of  freedom,  the  t-distribution  may  be  used  to  construct  a 95%  confidence 

A 

interval  for  w(0). 

Skipping  over  the  algebra,  the  resulting  95%  confidence  interval 

A 

for  the  value  of  w(0)  in  the  example  is 
(4.40185,  4.59660) 

A 

Thus,  an  approximate  95%  confidence  interval  for  y(0),  the  actual  concen- 
tration on  the  original  scale,  is 

(81.6,  99.2) 

This  relatively  wide  confidence  interval  is,  of  course,  reflecting  the 
variability  of  a single  sample.  This  interval  may  be  made  narrower, 
however,  by  obtaining  more  than  a single  sample  in  a given  situation. 

For  example,  if  the  observed  concentration  of  84.9  ppm  were  based  on  the 

A 

average  of  four  samples,  the  95%  confidence  interval  for  y(0)  would  be 
reduced  to  (approximately) : 

(87.0,  94.4)  . 

C.  VARIABILITY  OF  THE  SAMPLING  METHODS 

As  mentioned  in  the  previous  section,  the  analysis  of  variance 
indicated  that  no  differences  existed  between  the  four  sampling  methods. 

This  conclusion  pertains  to  differences  in  means.  The  analysis  of  vari- 
ance does  not,  however,  indicate  whether  the  variability  of  the  observations 
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within  any  given  sampling  method  is  different  than  the  variability  of 
another  sampling  method. 

Because  this  is  of  interest,  tests  for  the  quality  of  variance  were 

conducted  using  the  Hartley  F statistic  which  is  a ratio  of  the  largest 

max 

to  the  smallest  of  the  four  sample  variances.  In  general,  no  statistically 
significant  differences  in  variability  of  the  four  sampling  methods  were 
noted.  However,  for  some  reason,  during  the  initial  analysis  (i.e.,  at 
time  t » 0) , measurement  error  tended  to  be  smaller  for  sampling  method 
#2  (vacuum  only,  single  purge)  than  for  the  other  sampling  methods. 

D.  VARIATION  DUE  TO  FLASKS 

In  the  experiment,  the  variability  of  any  observation  is  assumed  to 
be  composed  of  a number  of  sources.  One  of  these  sources  is  variability 
resulting  from  the  fact  that  different  flasks  are  used  in  the  experiment 
and  are  assumed  to  be  a random  factor.  In  other  words,  the  flasks  are 
treated  as  a random  sample  from  a population  of  flasks,  rather  than  as 
flasks  which  are  specifically  identified.  This  treatment  is  realistic 
since,  in  general,  the  actual  flasks  used  in  sampling  are  obtained  "off 
the  shelf". 

If,  on  the  other  hand,  the  flasks  were  treated  as  a fixed  factor, 
where  each  individual  flask  had  an  unique  identity,  the  variance  of  any 
observed  observation  would  be  reduced,  because  the  singular  effect  of 
each  flask  could  be  accounted  for. 

In  the  current  experiment,  variation  due  to  flasks  appears  in  two 
components  (F  and  FT  of  equation  (1)  in  section  II. B.)  Because  of  the 
consideration  of  only  one  flask  for  low  concentrations,  estimates  of 
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these  components  were  obtained  only  for  high  concentrations.  For  all 
contaminants  other  than  //10  (trichloroethylene),  at  least  one  of  these 
two  components  was  statistically  significant,  indicating  that  using  dif- 
ferent flasks  does  add  variability  to  the  results. 

In  order  to  get  some  idea  of  how  much  random  variability  might  be 

eliminated  by  regarding  flasks  as  a fixed  factor,  the  estimate  V of  the 

F 

variability  of  an  observation  with  flasks  regarded  as  fixed  was  compared 

with  the  estimate  VR  of  the  variability  of  an  observation  with  flasks 

regarded  as  random.  (For  purposes  here,  the  variation  of  interest  is 

about  the  plot  mean.  Thus,  variation  due  to  plots  was  not  included  in 

the  calculations.)  A rough  estimate  of  how  much  the  variability  may  be 

decreased  is  given  by  the  ratio  (V  - V )/V  which  ranges  from  a low  of 

R F R 

62  for  contaminant  //2  (carbon  monoxide)  to  a high  of  77%  for  contaminant 
#8  (methyl  chloroform). 
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IV.  SUMMARY  AND  CONCLUSIONS 


The  two  main  topics  addressed  in  this  experiment  were  (1)  whether 
differences  existed  in  the  performance  of  the  four  grab  sampling  methods 
used,  and  (2)  whether  the  grab  samples  have  a shelf  life,  i.e.,  whether 
a delay  in  analyzing  the  samples  affects  the  results.  The  statistical 
analysis  of  the  experimental  data  indicates  that  each  of  the  four  sampling 
methods  performs  equally  well.  Thus,  the  choice  of  one  of  these  grab 
sampling  methods  should  be  made  on  some  other  basis,  such  as  the  time 
and  costs  associated  with  preparation  and  actual  sampling. 

Although  the  experiment  did  not  reveal  an.,  differences  in  grab 
sampling  methods,  it  did  indicate  that  the  samples  do  have  a shelf  life. 

For  all  contaminants  except  the  refrigerants  and  methane,  a statistically 
significant  exponential  time  decay  is  present  in  the  shelf  life  of  a 
sample.  Although  decay  parameter  values  are  not  the  same  for  each  contam- 
inant, the  parameter  value  is  independent  of  the  concentration.  For  two 
of  the  refrigerants  (R-12  and  R-114) , a statistically  significant  exponential 
time  growth  was  found  in  the  shelf  life,  again  independent  of  concentration. 
No  statistically  significant  time  effect  existed  for  the  remaining  refrig- 
erant (R-113)  or  for  methane.  For  those  contaminants  which  exhibited  a 
statistically  significant  time  effect  in  shelf  life,  a better  estimate  of 
the  actual  contaminant  concentration  at  the  time  of  sampling  may  be  obtained 
by  adjusting  the  observed  value  as  indicated  in  the  previous  section. 
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